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ABSTRACT 

We introduce a new method to detect solar-like oscillations in frequency power spectra 
of stellar observations, under conditions of very low signal to noise. The Moving-Windowed- 
Power-Search, or MWPS, searches the power spectrum for signatures of excess power, over 
and above slowly varying (in frequency) background contributions from stellar granulation 
and shot or instrumental noise. We adopt a false-alarm approach (Chaplin et al. 2011) to 
ascertain whether flagged excess power, which is consistent with the excess expected from 
solar-like oscillations, is hard to explain by chance alone (and hence a candidate detection). 

We apply the method to solar photometry data, whose quality was systematically de- 
graded to test the performance of the MWPS at low signal-to-noise ratios. We also com- 
pare the performance of the MWPS against the frequently applied power-spectrum-of-power- 
spectrum (PS<g)PS) detection method. The MWPS is found to outperform the PS<g)PS method. 
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1 INTRODUCTION 

Asteroseismology on solar-like oscillators can provide important 
knowledge on the internal structure and evolution of these stars, and 
in the case of planetary systems this knowledge is needed for much 
of the analysis on the planets themselves. The analysis and interpre- 
tation of the stellar oscillations relies of course on the actual detec- 
tion of the signatures of the solar-like oscillations. These signatures 
manifest as a pattern of near regularly spaced peaks, and hence a 
power excess, in the frequency spectrum of the stellar time series. 
The advent of space-based photometric observations, such as those 
provided by the NASA Kepler Mission (Borucki et al. 2010; Koch 
et al. 2010), is providing large quantities of exquisite-quality data 
for asteroseismology (Gilliland et al. 2010). The first stage of as- 
teroseismic analysis of solar-type stars often now involves the ap- 
plication of automated analysis pipelines to search for signatures of 
stellar oscillations. These signatures may then be analysed in detail 
if a positive detection is obtained. 

Many of the methods exploit the near regularity (or periodic- 
ity) in frequency of the oscillation frequencies, for example from 
computation of the autocorrelation of the timeseries (equivalently 
the power spectrum of the power spectrum, or PS®PS) (see, e.g., 
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Hekker et al. 2010; Mosser & Appourchaux 2009), or the autocor- 
relation of the power spectrum (see, e.g., Campante et al. 2010; 
Karoff et al. 2010) and variants thereof (see, e.g., Verner & Rox- 
burgh 201 1 ; Huber et al. 2009; Mathur et al. 2010). These methods 
will have difficulty in detecting a signal if the regular structure is 
obscured, as for example in stars with many mixed-modes, or in 
stars where the stellar oscillation signal is buried in noise so that 
beating with the noise becomes significant. 

Here, we present a method that instead searches just for the 
power excess originating from stellar oscillations. This Moving- 
Windowed-Power-Search method (hereafter MWPS) uses a moving 
window in frequency, and a statistical significance test is employed 
to see if any excess power flagged over and above the expected 
sources of background noise could be caused solely by fluctuations 
of the noise. The method does not rely on detecting any regular fre- 
quency structure in the stellar oscillation spectrum and is shown to 
work extremely well under low signal to noise (low SNR) condi- 
tions. To see how the MWPS method performs compared to cur- 
rently used methods, we test it against the PS®PS method, which 
is one of the most commonly used techniques for extracting the 
large frequency separation. Both methods are tested on solar data, 
to which increasing amounts of white noise were added in the time 
series. 

The layout of the rest of the paper is as follows. In Section 2 
we introduce the principles of the MWPS method, and discuss in 
detail its implementation. The PSCgPS, with which we compare the 
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MWPS, is discussed in Section 3. We present results of testing both 
methods on solar data in Section 4. Concluding remarks are given 
in Section 5. 



2 MOVING-WINDOWED-POWER-SEARCH 
2.1 Overview 

The oscillation spectrum of a solar-type star consists of a series of 
regularly spaced peaks with heights modulated by a broad enve- 
lope. This envelope of power due to oscillations may be approx- 
imated by a Gaussian, with i/ mM defined as the frequency at the 
peak of the power envelope of the oscillations where the observed 
modes present their strongest amplitudes (Chaplin et al. 201 1). 

The power 1 due to the oscillations sits on a slowly varying 
(in frequency) background power that we assume is dominated by 
contributions from shot/instrumental noise, stellar granulation and 
activity. When the observed power in the oscillations relative to the 
background is high (i.e., high SNR), the power excess due to the 
oscillations will be clearly visible. However, at low SNR, statistical 
fluctuations in the background power may swamp the oscillations 
signal, so that the excess is much harder to see. 

The MWPS method searches the spectrum for the presence of 
excess power, over and above the slowly varying background. A 
statistical false-alarm (null hypothesis) test is applied to ascertain 
whether any flagged excess is hard to explain by chance alone, i.e., 
random fluctuations of the background. Such cases may be flagged 
as candidate detections, subject to some a priori constraints regard- 
ing the total excess power solar-like oscillations would be expected 
to show in the flagged frequency range (of which more in Sec- 
tion 2.3 below). 

The method is clearly reliant on providing an accurate esti- 
mate of the underlying background, since any excess is defined rel- 
ative to that background. The background fitting is discussed in 
detail below, in Section 2.4. First, we discuss the statistics of the 
MWPS (Section 2.2), and logistics relating to how the frequencies 
are searched for evidence of a power excess (Section 2.3). 



2.2 Statistics of the MWPS 

At each tested frequency we estimate a power-to-background ra- 
tio, PBRtot, over a pre-defined window in frequency. This ratio is 
defined according to 



PBR, 



tot 



Ptot 

Btot ' 



(1) 



where P to t is the total power across the test frequency window, 
i.e., it includes any excess power above the underlying fitted back- 
ground, and the contribution from the background; and Btot is me 
total power in the background, i.e., the area below the fitted back- 
ground curve 2 . Note that this definition of PBR to t corresponds to 
the value of SNR tot +l in Chaplin et al. 2011. The value of PBR tot 
is normalised in such a way that it will tend to PBR to t ~ 1 in the 
case of no excess power from oscillations. The motivation for us- 
ing a window instead of simply the entire power spectrum is, that 

1 The power spectrum was calculated using a sine-wave fitting method 
(see, e.g., Kjeldsen 1992, Frandsen et al. 1995) which is normalized accord- 
ing to the amplitude-scaled version of Parseval's theorem (see, Kjeldsen & 
Frandsen 1992), in which a sine wave of peak amplitude, A, will have a 
corresponding peak in the power spectrum of A 2 . 

2 Ptot and Btot was found by the sum of the discrete values 



by using a frequency range much larger than the one containing 
the excess power from oscillatons, the value of PBR to t would be 
much degraded, reducing the chances of making a detection. The 
tested frequencies and associated window widths are discussed in 
Section 2.3 below. 

We assume that the individual bins of the power spectrum fol- 
low X2 statistics, i.e. that they follow a chi-squared distribution 
with two degrees-of-freedom (d.o.f.). As the sum of two such inde- 
pendent distributions xl + Xm is y et another Xn distribution with 
n = k + m d.o.f., the sum of power from the N independent' 1 bins 
included in the envelope interval will then be distributed according 
to a x 2 distribution with 2iV-d.o.f. We may therefore compare the 
value of PBRtot with a x 2 2JV-d.o.f. statistic (Appourchaux et al. 
2004). The functional form of the x 2 distribution is given by: 



f{x;k) 



2 fe / 2 r(fc/2) * 



c/2 



(2) 



where k is the number of d.o.f., F is the gamma function and x is 
the calculated value of PBR to t (actually x — PBR to tfc). 

Knowing the distribution, it is possible to calculate a corre- 
sponding probability, or p va iue, for the obtained value of PBR to t, 
i.e. the probability of observing a value ^ x by chance, assuming 
the null hypothesis is true, i.e., 



Pvaiue = 1 - F(x; k) 

= 1—1 f(x; k)dx 



(3) 



where F(x\ k) is the cumulative x\ distribution. Needless to say, 
the lower the p V aiue the less likely it is that the observed power can 
be attributed to random fluctuations of the underlying background. 
If the pvaiue falls below a pre-defined significance level (or false 
alarm probability), a, the null hypothesis can be rejected and the 
observed power is then assumed to be statistically significant. Here, 
we have set a — 0.01. 

The threshold value of PBR to t required to reject the null hy- 
pothesis is given by: 



P(PBRtot > PBRt 



i) = a, 



(4) 



which amounts to finding the PBR at which p V ai uc = ct. In sum, the 
"false-alarm" approach means that every calculated p va i U c falling 
below a is flagged as marking the presence of significant excess 
power, which could be attributable to stellar oscillations. The pro- 
cedure thereby gives us a probability curve across the frequency 
interval for each of the selected windows. 



2.3 Tested frequencies and window widths 

We make no a priori assumptions on the true value of i/ max and 
thereby on the range of frequencies covered by any Gaussian power 
envelope that might be present. However, we do allow for the fact 
that the FWHM in frequency of the oscillation envelope is known 
to vary to good approximation as ^max/2 (see, e.g., Stello et al. 
2007; Chaplin et al. 2011), and so test for the presence of excess 
power over windows of the spectrum whose width in frequency is 
varied accordingly. 

The centre frequency of each test window may be regarded as 
a proxy for v max . When the MWPS is applied at low frequencies 
the width of the test window should be set fairly narrow, since any 
oscillation signal present will also be confined over a narrow range 



No oversampling is used in the computation of the power spectrum 
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of frequencies. A wide test window - i.e., one significantly wider 
than the oscillation power envelope - would reduce the underlying 
PBR, and hence reduce the chances of making a detection with the 
MWPS. The same logic dictates that the window width must be 
set much wider when the test is applied at higher frequencies (too 
narrow a window will result in signal being missed). 

Assuming a Gaussian-like power excess, there is notionally 
an optimum window width that will maximize the underlying PBR, 
which corresponds approximately to 1 . 17-times 4 the FWHM of the 
power envelope, which is 0.59 x v m „, i.e., 0.59-times the test fre- 
quency. This width may not always yield optimum results in prac- 
tice, due to the importance at low SNR of beating with background 
noise, and the impact of realization noise from any oscillations that 
are present. We make suitable allowance for this in our adopted 
search strategy. 

We begin by selecting 40 frequencies spread evenly in the 
range from 1000 to 5000 ^iHz (adopted range for solar-like oscil- 
lations). An optimum window width is calculated for each of these 
i/ mal proxy frequencies, giving a total of 40 windows with which 
to test the power spectrum. Each window is then moved through the 
power spectrum in steps set by a pre-defined lag frequency (over- 
laps, but not gaps, between frequency ranges encompassed by the 
respective windows are allowed), and the MWPS test is applied at 
each location. The check for excess power at each location involves 
first calculating the total power in the envelope interval set by the 
window width, which may then be compared to the total power 
found below the fitted background (see Section 2.4 below for de- 
tails of the background fitting). 

We do not vary the window size as each window is moved 
through the spectrum. The reason for this is twofold: First, it turns 
out that varying the window width tends to skew the resulting SNR 
and 1/pvaiue curves, mainly due to overlap effects as the window 
grows in size towards higher frequencies, and this inevitably intro- 
duces an unwanted offset in the peak position of the detectability, 
i.e., in the estimated ^ max . Second, use of a single varying window 
does not take into account that the selected (notionally optimal) 
window size may not be optimal in practice, as noted above. 

Application of the MWPS as outlined above yields an SNR 
(or equivalently PBR) curve and a corresponding 1/pvaiuc curve, 
as a function of frequency for each of the adopted 40 window 
widths. An ensemble analysis of those curves which have values 
of l/j3vaiuc that exceed the detection threshold then yields an es- 
timate of fmax- We discuss this procedure in detail in Section 4 
below. 

We then apply a sanity check on the results. The total power 
in the oscillations spectrum is a strong function of i/ mB , i.e., the 
higher ^ max , the smaller is the expected total oscillations power. 
We apply the following formulae from Chaplin et al. (2011) to es- 
timate the expected total power, based on the estimated i/ max : 



1.55A max ?7 
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Here P to t is the total power underneath the Gaussian shaped 
envelope in the range ±f max /2 around the estimated f ma x value, 
and A max is the radial-mode amplitude at f ma x- The factor rf is 
a correction for the apodization of the oscillation signal the closer 



c max is to the Nyquist frequency, um v , and Av is the average large 
separation (see discription to Eq. 1 0) which may be estimated from 
i/ nlal (see Eq. 14). This prediction may then be compared to the 
measured excess power in the spectrum. If the measured excess is 
significantly higher than the predicted excess (i.e., by some multi- 
ple of the combined uncertainties on the prediction and measure- 
ment) the candidate detection is flagged as questionable. Strong, 
narrow-band artefacts can produce signatures of this type, hence 
knowledge of known instrumental issues is important to help try to 
verify the robustness (or otherwise) of any claimed detection. 

2.4 Fitting the background 

Since the success of the MWPS method relies heavily on a good 
fit of the background, we devote this section to a description of the 
background fitting process. 

We took the approach of a \ 2 -minimization instead of directly 
maximising the likelihood function of the power spectrum (having 
a xl -statistic) mainly to decrease the computation time for the fit- 
ting. In order to use a \ 2 procedure the power spectrum is binned 
linearly in ln(V). This ensures first of all that the points used in the 
background fit are independent and secondly, that the density of 
points in frequency is larger towards lower frequencies where the 
bulk of variations due to background phenomena occur. The aver- 
aged datum with the smallest number of binned frequencies used in 
the fitting tests (Section 4) came from an average made over ~ 550 
frequency bins, which makes it safe to assume that the points will 
have a normal distribution, 5 justifying the use of a \ 2 -minimisation 
scheme: 



X' 



= E 

_ bg(xi;a 



y/n 



(7) 



(8) 



Here the sum is over the binned data, yi is the value of the binned 
datum i. bg(xi;n) is the value of the fitted background function, 
where a are the dependent variables for the background function. 
The weighting, given by <Ji, depends as seen on the number of fre- 
quencies, n, included in the binning. 

The function describing the background signal consists of 
power laws (Harvey 1985), each of which describes a specific phys- 
ical phenomenon. The power laws included describe the signal due 
to, respectively, stellar activity and granulation, given by (see, e.g., 
Hekker et al. 2010; Mathur et al. 201 1): 



(Xi;a) = 



+ 



+ S N . 



(9) 



1 + {Bgx.y 1 + (B a x 

In this equation A g and A a give the power of granulation and activ- 
ity respectively, while B g and B a represent the characteristic time 
scales for the two phenomena as the power remains approximately 
constant on time scales longer than B g / a (or equivalently frequen- 
cies lower than B~, a ), and drops off for shorter time scales. In the 
fitting, the exponent, c, of the granulation slope is kept as a free 
parameter, while the exponent for the activity slope is fixed to a 
value of 2. The fixed value of 2 for the slope of the activity com- 
ponent arises from the fact that we assume an exponential decay of 
the activity regions as a function of time . The constant parameter 



4 Estimated from tests on toy models of the power spectrum with vary- 
ing values of e.g. the shot noise and the parameters descibing the Gaussian 
power envelope 



5 With k > 50 a normal distribution can be assumed (e.g. Box, Hunter and 
Hunter 2005) 

6 The fourier transform of an exponentially decaying function is a 
Lorentzian function, i.e. a power law as in Eq. 9 with an exponent of 2 
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0.29 




Frequency [^Hz] 

Figure 1. Illustration of the impact of applying the runs-test, here shown 
on artificial noiseless data shown as the limit power spectrum in grey. This 
limit spectrum is modelled as a series of Lorentzian peaks with a regu- 
lar frequency spacing and a height modulated by a Gaussian envelope (see 
Eq. 10), which sits on top of a slowly varying background function. The 
points in the figure are the values for the binned spectrum. The dashed black 
curve shows the background fit to all the binned points, and as seen the 
fit is somewhat offset from the expected level. The green points mark the 
longest consecutive series of points above the background curve. The solid 
black curve gives the best fit to the red points, i.e., after removing the points 
flagged by the runs-test (green) along with the additional points on either 
side, given by the blue points. 



Sn gives the photon shot noise. Whilst the method was tested we 
omitted binned data from the first 250 /iHz of the power spectrum, 
so that only the granulation and the shot noise was needed to fit the 
background. The reason for only including these two components is 
that phenomena related to activity, such as e.g. spots (both intrinsic 
variations and rotational modulation), mesogranulation and super- 
granulation all have characteristic time scales longer than 4000 s 
(see, e.g., Harvey 1985), equivalent to frequencies below 250 ^iHz. 

A general problem when fitting the background to the power 
spectrum is of course that any excess power from stellar oscillations 
will offset the background fit, primarily through an overestimated 
value of the constant shot noise component, and make it very diffi- 
cult to pin down the correct value for the slope of the background 
function (see Fig. 1) which again could produce unwanted peaks in 
the calculated SNR curves. As this offset will lower the PBR value 
it can have a determining impact on the performance of the MWPS 
method, especially in high noise cases. 

To boost the robustness of our method and circumvent the 
offset from a potential power excess, a Wald-Wolfowitz runs-test 
(see Barlow 1989) is applied to the intial background fit to check 
for non-random offsets around the fitted curve. This type of in- 
formation in not obtainable from the fitting procedure as the \ 2 ' 
minimization is, by its very nature, completely blind to the sign 
of any deviation, whereas the Wald-Wolfowitz runs-test is blind to 
anything but the sign of the offset making it a good complimentary 
test. Any power excess which is strong enough to offset the back- 
ground fit will be distributed in such a way that there will be a rela- 
tively long run (i.e., a consecutive series) of points above the fitted 
curve at the position of the power excess (see Fig. 1). If the false- 
alarm probability obtained from the runs-test falls below a given 
value - indicating that the chances of the run occuring by chance 



are low - it may be assumed that the background fit is offset due to 
some power excess not described by the background function. 

The points comprising the longest consecutive series above 
(in power) the background fit are removed in addition to a certain 
user defined number of points at the low- and high-frequency ends 
of this series ' , and the background is fitted anew to the remaining 
binned points. In our testing we generally removed seven additional 
points at the low-frequency end, and three at the high-frequency 
end (remember that as the binned points are linear in \n(u) the dis- 
tance in frequency between two points at the low-frequency end 
is much shorter than the equivalent distance at the high-frequency 
end). 

If the best-fitting \ 2 -value remains high after this procedure 
has been applied, an extra power law could potentially be included 
in the background fit until a satisfactory fit is obtained. A good fit 
corresponds to a reduced x 2 -value of Xr ~ !■ If tne reduced \ 2 - 
value is lowered after the removal of the longest run from the runs- 
test, the MWPS procedure continues, even if this removal of points 
worsens the result from the runs-test. In this sense the probabilities 
from the runs-test are used more as indicators of a good fit, while 
the configuration of runs is actively used. We employed a Nelder- 
Mead simplex 8 algorithm for the minimisation of the \ 2 -value, as 
this seems to perform in a very robust manner albeit at the expense 
of being somewhat slower than other frequently applied methods, 
e.g. steepest descent methods. 

The starting values of the parameters of the background model 
were found as follows: For the granulation parameters - i.e., the 
granulation time scale B g and amplitude A g - we use informa- 
tion from the first derivative of the binned power spectrum in log- 
arithmic units. This derivative is found by applying a third-order 
Savitzky-Golay 9 (Savitzky & Golay 1964) smoothing filter to the 
binned power spectrum, from which the first derivative is directly 
obtainable. For the Savitzky-Golay filter we chose a window width 
of 41 points. In a first derivative curve like the one found, peaks 
will correspond to a rise or fall (i.e., a change) in power spectral 
density as a function of frequency. As mentioned in connection to 
Eq. 9, the contribution from granulation in the power spectrum falls 
at frequencies above B" 1 . The granulation time scale is expected 
to be shorter than the equivalent time scale for the activity and so 
we identify the highest frequency peak in the first derivative curve 
as originating from the change in power due to the granulation, 
from which one can readily find the associated time scale. For the 
amplitude A g we find, in the first derivative curve, the frequency of 
the first dip to the low-frequency side of the granulation timescale 
peak, and use the power spectral density in the power spectrum at 
this frequency as the estimate for A g . The shot noise level is esti- 
mated from the mean power spectral density at the high-frequency 
end of the power spectrum. Even though the Nelder-Mead algo- 
rithm is intended for localised optimisation, we found no significant 
change in the resulting background fits when the starting values for 



We set the constraint that the series must have a length of at least three 
points 

8 This algorithm along with the rest of the code is written in the program- 
ming languages PYTHON (http://www.python.org) and CYTHON 
(http : / /cython .org) 

9 In a Savitzky-Golay filter one makes for every data point, [xi,fi), a 
least-squares fit of a polynomial function (of a given order) to the data points 
within a window around this datum. The resulting smoothed value at Xi, is 
then the value of the polynomial at this position. In this sense the smoothing 
is performed in a moving window 
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Figure 2. Example background fit for one of the a = 280 ppm realizations. The 1 /^Hz smoothed power spectrum is plotted in gray, from 230 to 8500 /xHz. 
The red points show the binned power spectrum. The longest run found by the runs-test is shown with the green points, and the additional points to be removed 
are plotted in blue. Also shown is the fitted background function before (magenta) and after the exclusion of the green and blue points (black). 



the optimisation were varied slightly, which is to be expected given 
the low number of fitting parameters. 



3 COMPARISON METHOD 

We have compared the MWPS method against the power- spectrum- 
of-power-spectrum, or PS®PS, method. For solar-like oscillators 
the PS®PS method works well because high-order p-modes fol- 
low to good approximation a regular, periodic pattern in frequency 
(Tassoul 1980), described by 

u„, e =Au(n+ h + e)-£(£ + 1)D, (10) 

where n is the radial order and £ is the angular degree. Av is the so- 
called large separation, corresponding to the spacing in frequency 
between consecutive radial overtones, n, having the same angu- 
lar degree, £. e is a constant that is sensitive largely to the surface 
layers, and D depends upon the sound-speed gradient in the deep 
interior of the star. 

A near-regular pattern of modes in the power spectrum will 
produce significant peaks in the PS®PS at locations corresponding 
to the prominent frequency separations. The most and next-most 
significant peaks are usually those corresponding to the separations 
Au/2 and Av/4, respectively (the former corresponding to the av- 
erage separation between modes of odd and even degree, £). Before 
the PS®PS is computed the power spectrum is first corrected for 
the background, by division of the best-fitting background model 
given by the procedure described in Section 2.4 above. The PS®PS 
spectrum is then calculated from a window of the power spectrum 
centred on f m „, as per the MWPS method. For our tests here we 
set the frequency window in power spectrum in advance, centered 



on the true location of ^ ma x (rather than run a moving window 
through the spectrum). 

The statistics of the individual and independent bins in the 
PS®PS are also described to excellent approximation by \ 2 2-d.o.f. 
distribution. The probability of observing a relative power in a sin- 
gle bin greater than or equal to s (i.e. the p- value, with s the height 
divided by the background level of the PS®PS) is just: 

p(s) = exp(-s). (11) 

We may then determine the probability of observing a single bin 
with a value of s by chance out of N bins of the PS®PS by 

Pchancc = [1-P(S)P, (12) 

where £ = 1 for a non-oversampled PS®PS and £ ~ 3 for an over- 
sampling by a factor of 10 (Chaplin et al. 2002). The final probabil- 
ity of observing a value greater than or equal to s amongst bins 
in the PS®PS spectrum (i.e., not by chance) is : 

Pvaluc = 1- [1- P (S)P. (13) 

Again, if the value of p V ai uc falls below a = 0.01 the case is 
flagged as a possible detection of stellar oscillations. It is clear from 
Eq. 13 that the number of bins N used in the computation is cru- 
cial in determining the absolute value of p V aiuc- Taking N to be the 
total number of bins in the PS®PS would only yield detections in 
very high SNR cases. However, we may use a priori knowledge on 



10 An alternative approach would be to calculate the joint probability of 
observing the three highest values in the intervals around Au/2, Au/4 and 
A1//6 (see Hekker et al. 2010). One could also calculate the probability 
of observing the sum of points in the interval around the assumed Au/2, 
much as the approach followed in the MWPS method 
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(a) SNR curves. (b) l/p va lue curves. 

Figure 3. A typical result from an MWPS run, here for a realisation with a = 280 ppm noise case. Panel (a) shows SNR curves for a representative sample 
of the 40 window widths (with widths indicated on the legend). Panel (b) shows the corresponding l/p va lue curves. The red dashed line marks the one per 
cent significance level while the dashed blue line marks the five per cent significance level. 



stellar oscillations to restrict the adopted value of TV. The value of 
Av scales with v max (Stello et al. 2009) according to 

Av = a(^ max /VHz) /3 (14) 

with values of a = 0.22 and /3 = 0.797 (Huber et al. 2011). Pre- 
dictions from stellar evolutionary models suggest that the value of 
Av given by the relation is accurate to around 15 per cent, which is 
the uncertainty we therefore adopt to define the search range around 
the Av/2 value (having adopted the centre of the frequency win- 
dow in the power spectrum as the proxy for f max ). This decreases 
the value of TV significantly, compared to the total number of bins 
in the PS®PS, and ensures that the method performs as effectively 
as possible when compared to the MWPS method. 



4 MWPS AND PS®PS TESTS 

Both methods have been tested using a 500-day-long time series of 
photometric data on the Sun, obtained in the green band (500nm) 
of the VIRGO-SPM instrument (a full-disc Sun photometer) on 
board the ESA/NASA SOHO spacecraft. This time series has a 
duty cycle of nearly 99 per cent (Roca Cortes et al. 1999), which, 
combined with its long duration, makes it safe to assume inde- 
pendent frequency bins in the power spectrum with xl statistics. 
To this timeseries were added different levels of random Gaus- 
sian noise, with sample standard deviations covering the range 
a — {180 ppm, 200 ppm, . . . , 280 ppm}, resulting in a total of 
six tested SNR levels. For each noise level we generated 20 power 
spectra, each having a different realization of the added random 
noise. 

For all the power spectra we omitted the first 250 /iHz in the 
fit of the background, and found sufficiently good fits (in the sense 
of a low x 2 -value) when including only the granulation and shot 
noise components. The chosen lag in frequency of the MWPS was 
~ 2.32 /iHz (equivalent to 100 frequency bins), which gives very 
good resolution. The bin size for the background binning was cho- 
sen as Abin = 0.05 (linear in In(i/)). The envelope width of power 
excess was for all v max proxies was taken to be 0.59-times i/ max . 



An example of one of the background fits is plotted in Fig. 2. 
It shows results on one of the realisations with a = 280 ppm. The 
plotted power spectrum has been smoothed with a 1 /iHz-wide box- 
car filter. The red points show the binned spectrum, to which the 
background function was fitted. The region with the longest run de- 
tected is shown by the green points, while the blue points show the 
additional points removed on the low- and high-frequency sides of 
the longest run. We also show the background fits to the points be- 
fore (magenta curve) and after (black curve) the removal of the blue 
and green points. While the change in the background fit is quite 
subtle and hardly visible in this case, the reduced \ 2 did change 
favourably, from a value of Xr = 1-24 with 66 d.o.f. to Xr = 1-06 
with 51 d.o.f. Fig. 3 shows the MWPS results for the data plotted in 
Fig. 2. Panel (a) shows the SNR (i.e., PBR — 1) curves for several 
different window sizes (see plot anotation). The window sizes used 
here were selected as described in Section 2.3. A value of zero on 
the ordinate in effect corresponds to the total absence of oscillation 
signal. Negative values mean that the background was estimated 
to be higher than the actual observed power. Panel (b) shows the 
corresponding 1 /p va iue curves. The dashed red line marks the one 
per cent significance level while the dashed blue line marks the five 
per cent significance level. Note that even though the most narrow 
window dominates the SNR curve its 1 /p va iuo is not as prominent 
due to the fact that its window comprises a smaller number of d.o.f. 
compared to the other windows. 

We estimate f max from the l/p va iue curves as follows. We be- 
gin by calculating a mean l/p va iue curve from all curves in which 
a detection was flagged. Using the mean curve instead of just the 
individual l/p va tue curves ensures that if there are spurious peaks 
(e.g. from very narrow windows) these will not have a large impact 
as they are not seen in wider windows. The mean 1 /p va i uc curve is 
then smoothed with a Savitzky-Golay filter (which ensures that the 
area under the curve, the peak positions, and the width/height ratio 
of peaks, is conserved), with a window width of 61 points (corre- 
sponding to ~ 142/iHz, with an adopted lag frequency of about 
2.32/^Hz). The estimate of v m&K is then given by the frequency 
at the highest value in this smoothed curve. To estimate the con- 
fidence limits on the estimated f max we treat the smoothed, mean 
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Figure 4. Smoothed average l/p va i uc curve, again for one of the 
cr = 280 ppm realizations. The solid black line gives the estimated i/ ma x, 
while the two dashed black lines represent the confidence interval (equiva- 
lent 1-er interval). The dashed red and blue lines correspond, respectively, 
to the one and five per cent significance levels. 



1/fvaiue curve as a probability density curve. Here the confidence 
limits are found by taking the 68.27% confidence intervals around 
i/ mal (see Barlow 1989). This is calculated by finding the shortest 
interval around f ma x wherein 68.27% of the area under the curve 
is contained, equivalent to a one sigma limit for a Gaussian prob- 
ability distribution. An example of an average 1 /p V aiue curve with 
estimated confidence limits can be seen in Fig. 4. 

For the computation of the significance in the PS®PS we used 
the same background fits for correcting the power spectrum as were 
used for the MWPS method. For the search range, the adopted vari- 
ation in the value of Af/2 of ±15 per cent gave N — 13 bins to be 
used in Eq. 13. The background level in the PS®PS spectrum (not 
the power spectrum) was found by first smoothing the PS®PS spec- 
trum with a median filter. A low-order polynomial was then fitted to 
the smoothed spectrum, where the search range around Ai//2 was 
excluded from the fit. The PS®PS was then divided by the fitted 
polynomial curve, giving relative power in the PS®PS spectrum. 

Fig. 5 shows non-oversampled and oversampled PS®PS re- 
sults for a = 180 and a — 280 ppm noise cases (the latter for the 
same data as Figs. 2, 3 and 4). The coloured regions in the figure 
show the range tested for significant power. Results are shown at 
both noise levels to illustrate the degradation of the PS®PS sig- 
nal in going from the lowest to the highest noise case considered. 
The a — 280 ppm signal may be compared with the firm detection 
found at this noise level by the MWPS method. 



4.1 Results 

Panel (a) of Fig. 6 shows the 1 /p V aiuc results from the MWPS tests. 
A detection is flagged as positive if the l/p V aiue is greater than 100 
for a = 0.01 or 20 for a = 0.05, corresponding to the dashed red 
and blue lines respectively. For each of the noise levels the indi- 
vidual realisations are plotted with empty circles, and the median 
values for each of the noise realisations are marked by the red pen- 
tagons. The results for the PScgiPS method are shown in panels (b) 
and (c). Panel (b) shows the results from non-oversampled PS®PS 
spectra, while panel (c) shows the results when oversampling by a 
factor 10 was used. 



It is apparent that the MWPS returns a higher detection rate 
than the PS®PS method, i.e., not only are the median detection 
rates higher, but there are also a much smaller fraction of non- 
detections. The MWPS also shows a much larger spread in the in- 
ferred 1/pvaiucs, but as the values found in general give positive 
detections at a high significance level this is not a major cause for 
concern. 

Since in our test data we know the location in frequency of the 
oscillations, we are also in a position to say something about the 
occurrence rate of false positive detections. Fig. 7 plots the posi- 
tion in frequency of flagged detections (at the one per cent level). 
The noise levels are indicated by the colour coded legend. The solar 
value of z^max© = 3100 fiHz is marked by the dashed red line. The 
lines at 2325 and 3875 /iHz mark the interval we set for a correct 
positive detection. This interval corresponds to a one standard de- 
viation of a Gaussian power envelope having f max = 3100 piHz. 
Points that fall in the shaded areas, outside this frequency range, 
might therefore be considered as putative false-positive detections. 
Only six realisations over the entire ensemble fell well within the 
shaded areas, three of which were from the highest noise case. Ex- 
cept for the false-positive in the a = 260 ppm case we find that 
in the remainder of the false-positive cases the MWPS is still pick- 
ing up part of the underlying signal, but the central frequencies 
are offset due to beating with the noise. These cases could right- 
fully be seen as noise-affected detections rather than genuine false- 
positives. This suggests that even under quite extreme SNR condi- 
tions, the MWPS may still be used to ascertain if excess power is 
present, but the estimate for the actual v mBX value should not be 
trusted in very high noise cases. 

When at each noise level we calculate the difference between 
the estimated f ma x and the true, underlying i/ maX 0, in units of 
the estimated uncertainties, we find that the fraction of detections 
that lie within lcr of the true value stay fairly constant, at around 
50 per cent. However, the average uncertainties do of course in- 
crease, reaching about 5 per cent at a noise level of cr = 260. To put 
this fractional precision in the context of stellar properties estima- 
tion, consider, for example, the use of f ma x to estimate log((?), via 
the scaling relation ^ max oc gT~^ 5 (see, e.g., Brown et al. 1991; 
Kjeldsen & Bedding 1995). Under the assumption of superior pre- 
cision in any complementary estimate of T e s, a fractional precision 
of 5 per cent in i/ ma x equates to a similar fracitonal precision in g, 
and a precision of about 0.02 dex in \og(g). 

With a fraction of non-detections of only 10 per cent at a — 
280 ppm we find (two realisations fall below detection at the 1 per 
cent level), as mentioned above, that the MWPS can be utilised at 
even higher noise levels to ascertain if excess power is present. 

In most of the false-positive cases it is clear that there is an 
asymmetry in the estimated uncertainties on the central detection 
frequency, with the larger uncertainty pointing towards the true 
i/ max value. In the bulk of these cases the asymmetry is caused by 
multiple peaks with nearly the same height in the mean l/jJ V aiues 
curve (supporting our contention that beating with the noise is to 
blame), where the highest peak happens to be at the low frequency 
end. The fact that the uncertainty goes mostly towards the true i/ mH 
value also supports our claim that part of the underlying signal is 
being picked up. 

From looking at the results presented in Fig. 5, it is clear 
that, in the sense of determining an accurate estimate for f ma x, 
the PS®PS method will be much better. This can be seen from the 
fact that the l/p V aiucs peaks lie, in frequency, at the position of the 
Af/2 value. The application of Eq. 14 would therefore return a 
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(a) Non-oversampled PS®PS spectrum, a = 180 ppm noise. (b) PS0PS spectrum oversampled by a factor of 10, a = 180 ppm noise. 
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(c) Non-oversampled PS® PS spectrum, a = 280 ppm noise. 
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(d) PScgiPS spectrum oversampled by a factor of 10, a = 280 ppm noise. 



Figure 5. Results of applying the PSCgiPS method to one of the a = 180 and a = 280 ppm noise realizations. The spectra shown in this figure, each have 
a l/p va i U e that lies close to the median value for the respective noise levels, see Fig. 6. Panel (a): l/p va i uc for non-oversampled PSS5PS spectrum of the 
background corrected power spectrum. Here, the PS®PS spectrum has been divided through by a smoothed background curve, and the l/p V aluc has been 
calculated using Eq. 13. The shaded area shows the ±15 per cent interval around Av/2 which is used as the search range for significant power. Panel (b): 
oversampled result for the same dataset. Panels (c) and (d): results of applying the PSC3PS method to one of the <r = 280 ppm noise realizations (same data 
used to generate MWPS results in Fig. 3). The red and blue dashed lines mark the one per cent and five per cent significance levels respectively. The dashed 
vertical lines give the position, in frequency, of Au/2, Av/A, and Au/6. 



value for ^ max very close to the true value (under the assumption 
that the correct peak is used for the computation). 



5 CONCLUSION 

We have demonstrated that the MWPS method provides an efficient 
way of detecting excess power from solar-like oscillations under 
low signal-to-noise conditions, and that it also outperforms the fre- 
quently applied PS®PS method. It is worth noting that while the 
efficiency of the PS® PS will be affected by departures from near 
regularity of the mode spacings in frequency (e.g., in evolved solar- 
type stars showing mixed modes, or in stars showing significant 
variations of Av with frequency), such departures will not affect 
the performance of the MWPS method. 

It is clear that the method is well suited as the initial step in the 



search for stellar oscillations in low SNR targets, and can provide 
useful information and guidelines as to where one should look more 
carefully for signatures of the near-regular frequency separations of 
the modes. It should also be evident that the greatest challenge for 
this otherwise relatively simple method is that of obtaining a re- 
liable background fit, since inferences drawn rely on the implicit 
assumption that the fitted background is close to the actual, under- 
lying background. We found that the introduction of the runs-test 
provided an extremely useful additional constraint to assess the ro- 
bustness of the background fitting. 
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Figure 6. Results from the 20 different realisations of the a = 180 ppm to cr = 280 ppm noise cases. Panel (a) shows the results obtained from the MWPS 
method, while panels (b) and (c) show the corresponding results from the PS®PS method (no oversampling, and over-sampling by a factor of 10). The values 
in the MWPS plot, given by open symbols, are the maxima l/p V alue of the smoothed, average l/p V alue curves for the individual realisations. In all panels 
the open symbols show results of the individual realisations, while the red symbols are the median l/p va i ue value over all realisations at each noise level. The 
red and the blue dashed lines show the threshold values for, respectively, a one per cent and a five per cent detection threshold. 




Figure 7. Estimated i/ max values from the MWPS method, for different realizations of each noise level (see plot legends). Note that for the cr = 260 and 280 
ppm only 1 8 points are plotted. This is due to the fact that both of these noise levels have two realisations which fall below detection at the 1 per cent level, as 
also indicated in Fig. 6 
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